Astronomy & Astrophysics manuscript no. century 


February 2, 2008 


(DOI: will be inserted by hand later) 





o 
o 

(N 
-i— > 
O 

O 

(N 



Century scale persistence in longitude distribution: in the Sun 

and in silico 

J. Pelt 1 ' 2 , I. Tuominen 2 , and J. Brooke 3 ' 4 

1 Tartu Observatory, 61602 Toravere, Estonia 

2 Astronomy Division, Department of Physical Sciences, P. O. Box 3000, FIN-90014 University of Oulu, Finland 

3 Manchester Computing, University of Manchester, Oxford Road, Manchester, M13 9PL, UK 

4 Department of Mathematics, University of Manchester, Oxford Road, Manchester, M13 9PL, UK 

Received — ; accepted — 



> 
ON 

O 

o 

Oh: 

6 



Abstract. Using Greenwich sunspot data for 120 years it was recently observed that activity regions on the 
Sun's surface tend to lie along smoothly changing longitude strips 180° apart from each other. However, numerical 
experiments with random input data show that most, if not all, of the observed longitude discrimination can be 
looked upon as an artifact of the analysis method. 
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1. Introduction 

The time distribution of sunspot latitudes is well known 
and the ubiquitous butterfly diagram is becoming a kind 
of trademark in solar research. The same is not true for 
longitudes. It is still not known how persistent in time are 
different statistical features of longitude distributions for 
various activity indicators (see for instance Bai 2003 and 
references therein). The present letter is inspired by the 
recent contribution by Berdyugina & Usoskin 2003 (here- 
after BU). By using certain data processing techniques 
they built smooth curves which run along central longi- 
tudes of sunspot activity centres. They observed that dis- 
tribution of activity centre phases (if computed against 
mean flow) has a double peak distribution at a high level 
of confidence (see Fig. 5 in BU). They concluded that there 
is a century scale persistent feature in the sunspot longi- 
tude distribution: the local neigbouring maxima of activ- 
ity tend to lie 180°apart. They also observed several "flip- 
flop" events (see Jetsu et al. I1993fl and concluded that 
there is a strong analogy between the Sun and rapidly ro- 
tating magnetically active late-type stars (for an overview 
of active stars see Tuominen et al. 120020 . We can also 
formulate their result in metaphoric terms: the Sun has 
two faces, and as a kind of peculiar Janus it sometimes 
switches them back and forth. 

In this paper we analyse randomly generated distribu- 
tions to check the possibility that the observed effect in 
BU is a result of dependencies and correlations hidden in 
the statistical method itself. And indeed, we found that 



it is possible to obtain quite similar double-peak distri- 
butions for random data. Consequently the statistically 
significant part (or even all) of the effect described in BU 
can be safely ascribed to the artifacts of data processing. 
It is still reasonable to assume that the mean magnetic 
field on the Sun (at least in the regions of spot formation) 
is axisymmetric, when the mean is taken over sufficently 
long time. 

What follows is a short technical account of the anal- 
ysis done. 

1.1. Simulations for active longitude distributions 

The strongest argument in BU for the century scale per- 
sistence of active longitudes is given in their Fig. 5 where 
the two strong and symmetric distribution maxima indi- 
cate that sunspot groups tend to concentrate around two 
centres, 180° (0.5 in phase) apart. 

To check their result and its methodological under- 
pinnings we did some very simple numerical simulations 1 . 
Instead of using actual observational data we generated ar- 
tifical random data sets and tried to process the obtained 
distributions along the same lines as in the original paper. 
There was no need to simulate all data processing steps 
performed in BU. It was enough to start from the point 
where for each Carrington rotation the phases for activity 
maxima were computed. For simplicity we assume that 
for every rotation there are at least two activity maxima, 
each representing a sunspot group concentration. 
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Full source codes of relevant software can be downloaded 
from http:/ /www. aai. ee/^ pelt /soft. htm 
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Fig. 1. Fragment of the computer generated phases 0i(ti) 
(diamonds) and 0a(£i) (points). Thick line is a half year 
average of the principal maxima (M±(t)), thin line is a half 
year average of the secondary maxima (Ma(t)). The mean 
curves can cross each other. 
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Fig. 2. Distribution of the differences between phases 
4>i(ti), 4>2(ti) and the half year mean M 1 ^) for one par- 
ticular run (first from the series of 100). 
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Fig. 3. Mean distribution of the 100 separate runs. The 
distribution is flat in the centre. 

Each individual simulation run proceeded then as fol- 
lows. First, for each of the N = 1720 Carrington rota- 
tions with starting times ti,i — 1, ... ,7V two fully ran- 
dom and statistically independent phases 0i(ij), foiU)^ = 
1,....N were generated. The evenly distributed values 
<= (f>i(ti) < 1 and <= <j>2{U) < 1 were then used as 
simulated phases for principal and secondary maxima of 
sunspot density distribution along longitude for particular 
rotation t%. 

For both sequences of phases 0i and 02 the half year 
means were calculated and from the mean values, by lin- 
ear interpolation, the two contiuous mean curves M\{t) 
and M 2 (t) were built. To get the analogue of the Fig. 5 
of BU the distribution of the differences of the all gener- 
ated phases and one of the continuous curve (Mi(t)) was 
computed. In Fig. ^ a short fragment of generated phases 
together with two running means and in Fig [3 the cor- 
responding distribution for all differences are plotted. To 
eliminate the effect of the statistical fluctuations we gener- 



ated 100 independent distributions and in Fig.^the mean 
distribution of these runs are plotted. As we see the means 
are uncorrelated and the distribution is approximately flat 
in its central part. 

1.2. Phase adjustments 

In BU, the original phases for active longitude centers 
were not used straightforwardly, but were modified. The 
authors say "We plot the recovered phases of sunspot clus- 
ters versus time and find that regions migrate in phase as 
rigid structures. When a region reaches = 1, it appears 
again near = 0. In such cases we add an integer to the 
phase and unfold continuous migration of the regions". 
Unfortunately the desription of the actual procedure of 
the described processing step remains somewhat obscure. 
From Usoskin ( 2003 ) we understood that what was done 
was something similar to the following: 

— There was a preset constant, say <50. 

— When particular maximum occured at a phase which 
exceeded 1 — <50 and when one of the maximum phases 
for the next rotation was less than 50, then such sit- 
uations were looked upon as a candidate for a phase 
jump. 

— If the prospective phase jump improved "continuity" 
of the assumed migration paths it was accepted. 

It is important to note that the jumps were not con- 
strained by the type of maxima (main or secondary) . To 
ensure continuity of the migration paths the authors of 
BU allowed swaps between the principal and secondary 
maxima ( "flip- flops" ) . 

To avoid any subjective judgement we applied a similar 
but fully automatic procedure in our simulation program. 
Because the actual value for 5(f) (if it ever existed in numer- 
ical form) was not communicated to as, we tried multiple 
values of it. In a "crossing situation" we constructed all 
possible jumping schemes (no jump at all; largest phase 
jumps; if both can jump, then smaller phase jumps) and 
selected in each situation a scheme with a smallest change 
in phases. Thus, if before crossing point, the phases for 
two maxima were 0i and 02 and after crossing 03 and 04, 
then the selection criterion was the minimization of the 
0i — 03 1 + |02 — 04 1- We believe that the actual procedure 
of the original authors was carried out similarily, even if it 
was not implemeted as a rigorous algorithm but was done 
by manual adjusting. 

We computed analogously to Fig. [3] distribution dia- 
grams for several values of <50 in the range 0.05 — 0.25. 
All diagrams had a characteristic bimodal form with two 
distribution maxima (persistent longitudes!). For presen- 
tation purposes we give here an example with one partic- 
ular parameter value S(f> = 1/18. This is the most natural 
value for the case when the histogram method of activity 
smoothing is used and the number of histogram bins is 
18 (see BU). In Fig. 01 we see a short (approximately one 
solar cycle) fragment of the two mean curves with upward 
drifting phases. In Fig.[5]the one concrete distribution (ef- 
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Fig. 4. Two rows of random phases with phase adjust- 
ments applied. The upward trend is due to the accumula- 
tion of the added full cycles. 
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Fig. 5. Distribution of the differences between phases and 
the mean Mi(t) after adjustment. Crossing parameter 
S<p — 1/18. The bimodal structure of the distribution 
demonstrates that century scale persistence of the active 
longitudes appear even in randomly distributed phases. 



Fig. 7. Fragment of random phases for constrained data. 
The series of maxima were generated by checking against 
minimum allowed distance between main and secondary 
maximum. Crossing parameter S(f> — 1/18 and distance 
parameter A0 — 0.30. The adjustments to follow phase 
drift were also applied. 
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Fig. 8. Phase differences from mean Mi(t) for con- 
strained data. 
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Fig. 6. Mean distribution of the differences for 100 runs. 
Crossing parameter Sip — 1/18. The bimodality in Fig. [5] 
is not an exceptional fluctuation. 



fectively the first from series of 100) is given. The Fig. 
depicts the mean distribution of all runs. As we see, the 
distributions are of two-peaked nature. 

We started from random and uncorrelated pairs of 
phases. How it is then possible that there are now cer- 
tain preferred positions for the random phases? It is not 
hard to see that the selection of phase jumping points 
tends to introduce extra statistical dependencies between 
neigbouring maxima in different rotations. Some of the 
extra large differences (j){ti) — <p(ti + i) can be excluded by 
doing jumps for appropriate phases. As a result overall 
mean of differences between sequential phases tends to be 
less. When these extra correlations and allowed swaps be- 
tween principal and secondary minima are combined the 
double-peaked distribution results. 



1.3. Restrictions from continuity of distributions 

For further discussion we need to introduce the notion 
of circular distance D(^>i,</>2). It is defined as min(|0i — 
02 |j 1 — |</>i — 02 1) an d it takes into account the circular 
nature of the phases (the point 1 is the starting point 
for the next turn). 

Our simulation model has been unrealistic up to this 
point. It is possible that both randomly generated phases 
4>i{ti) and <p2(ti) are equal or very near to each other. For 
real data this can not be so. Depending on the smooth- 
ing and maximum finding method, there is a certain min- 
imal circular distance A0 between two phases of max- 
ima. To simulate this kind of more realistic data we 
generated random phases as was done before. However 
for each phase pair we computed the circular distance 
D(4>i(ti), 02 (ii)) and pairs whose phases were too close 
(D{(pi{ti),(f)2{ti)) <= A0) were excluded from the result- 
ing data set. In this way we allow certain statistical depen- 
dencies between two phases but phase pairs remain totally 
independent (from rotation to rotation). 

A short fragment of data which is checked against min- 
imal A0 and with adjusted phases is given in Fig. (cross- 
ing parameter <50 = 1/18, distance parameter A0 = 0.3). 
The tendency of phases to cluster strongly around mean 
curves is well demonstrated in Fig.|Hl The plot of the mean 
distribution for 100 separate runs (Fig. EJ) shows that we 
are not dealing here with random fluctuations. It is pos- 
sible to make the "persistency" effect even stronger if we 
select certain "optimal" input parameters <50 and A0. 
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Fig. 9. Mean distribution of the phases for distributions 
like in Fig. 



2.5 








2.D 








1 .5 


t % 






1 .O 








0.5 









Fig. 10. Two rows of random phases and corresponding 
mean curves for locally correlated data. Crossing parame- 
ter Sip = 1/18, distance parameter Acf> = 0.3. In this kind 
of simulation even "flip-flop" events can be seen. 
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Fig. 11. Phase differences from mean Mi(t) for locally 
correlated data. 




Fig. 12. Mean distribution of the phase differences for dis- 
tributions like in Fig. ^] 



1.4. Local correlations 

If we return now to the original setup of the generation 
mechanism for the random data sets, we recall that the 
distributions for all Carrington rotations were assumed 
to be statistically independent. This is not of course the 
case for real distributions. It is well known that there are 
somewhat persistent activity regions on the Sun. If we try 
to model this kind of local correlation, the separation of 
the two maxima becomes even more pronounced. 



Look at the Figures^Jthrough^lwhich were obtained 
using locally correlated data sets. The two-peaked nature 
is now very pronounced and corresponding plots tend to 
be quite similar to the plots in the original paper. By fine 
tuning of the parameters it is possible to get a perfect 
match. But this is not our goal here. 

The local correlations were introduced by using the 
following generation scheme. For every new phase to be 
generated, one from the four possible actions was selected 
(with equal probability): 

— generate totally new phase, 

— repeat previous phase, 

— add to previous phase a small amount 5, 

— subtract from previous phase a small amount 5. 

As a result we got a sequence of globally uncorrelated (due 
to the first option) but locally correlated random walk 
(due to the other options) fragments. We are not arguing 
here that this is a very good model for a solar activity, 
our aim is just to show that distributions like these in 
Fig. 5 of BU can be obtained from locally correlated ran- 
dom sequences and that the correlatedness can strengthen 
distribution peaks of preferred longitudes. 

2. Discussion and conclusion 

The analysis above shows that phase distributions of vari- 
ous parameters (maxima in our case) tend to be very sen- 
sitive to hidden dependencies and correlations. It is not 
very easy to see from the first glance that if we look at 
random primary and secondary maxima, then a phase ad- 
justing to follow "rigid patterns" introduces a significant 
hidden statistical dependencies which show up as double 
peaked distribution of phase differences. 

When we combine all three types of dependencies and 
correlations, from phase tracking, from constrained phase 
differences and from real short time correlations, then we 
easily get significantly bimodal diagrams like Fig. 5 in BU. 
The flexibility due to the possibility to swap principal and 
secondary maxima ("flip-flop") and adjust phases ("rigid 
patterns") allows one to amplify every local short range 
correlation into magnificent century scale persistent phe- 
nomena. 

It is important to note that our simulation analysis 
does not rule out that in principle there can be certain 
persistent phenomena in sunspot longitude distributions. 
We just demonstrated that current evidence is not suffi- 
cient. 
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